Database Design For Corpus Storage: The ET10-63 Data Model

نویسندگان

  • Tony McEnery
  • Béatrice Daille
چکیده

Traditional le systems are the most common storage medium for corpora at present Within these corpora are stored as simple linear text les with only primitive attempts made at systematic organization For example the LOB corpus Johnasson Leech and Goodluck is stored as a series of les each le re ecting a particular textual genre Hence the only apparent organization within the le structure of LOB is a rough genre based division This imposes serious limits on the functionality of the system The corpus when stored should meet a goal of functional adequacy To enumerate the aspects of functional adequacy one should consider that

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic Synthesis of Drouths for Reservoir Storage Design (RESEARCH NOTE).

Time series techniques are applied to Ghara-Aghaj flow records, in order to generate forecast values of the mean monthly river flows. The study of data and its correlogram shows the effect of seasonality and provide no evidence of trend. The autoregressive models of order one and two (AR1, AR2), moving average model of order one and ARMA (1,1) model are fitted to the stationary series, where th...

متن کامل

Towards a Data Model for the Universal Corpus

We describe the design of a comparable corpus that spans all of the world’s languages and facilitates large-scale cross-linguistic processing. This Universal Corpus consists of text collections aligned at the document and sentence level, multilingual wordlists, and a small set of morphological, lexical, and syntactic annotations. The design encompasses submission, storage, and access. Submissio...

متن کامل

Model-Driven Integration of Compression Algorithms in Column-Store Database Systems

Modern database systems are very often in the position to store their entire data in main memory. Aside from increased main memory capacities, a further driver for in-memory database systems was the shift to a decomposition storage model in combination with lightweight data compression algorithms. Using both mentioned storage design concepts, large datasets can be held and processed in main mem...

متن کامل

Curriculum Design in the flipped classroom: the research synthesis Methods

  Flipped classroom is a way to create positive changes in education; therefore, in the present study we tried to offer a comprehensive operating model of implementation of this method based on research synthesis. The corpus of this study consisted of all scientific articles published about the implementation of flipped classroom. From this corpus 1084 papers were identified through constant se...

متن کامل

Data Storage Technology

The data storage and retrieval services utilized by the ECS services layer will be provided by both a Database Management System (DBMS) and a hierarchical le system (see Figure 2.1). The DBMS(s) will manage and provide access to structured data as well as certain types of unstructured data (e.g., text data), whereas the le system will provide direct access to the large data sets that will be pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993